Korpus: slk_newscrawl_2011

Weitere Korpora

3.7.1 String similarity graph for words

General information for Levenshtein distance for words. The data are considered as graph with words as nodes and edges weighted with Levenshtein similarity. Only the top 1.000.000 words are considered.

Number of nodes in the top-1M Levenshtein graph
Number of nodes
78090
Number of edges in the top-1M Levenshtein graph
Number of edges
292812
Minimum word length
Word length
4
Edge weights in the top-1M Levenshtein graph
Levenshtein distance Number of edgeses
0 10577
1 87033
2 195202
785 msec needed at 2017-10-20 21:20